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PAGE: I RAW SEQUENCE LISTING DATE: 1 1/04/97 

PATENT APPLICATION VS/08/826,361A TIME: 14:59:21 

INPUT SET: S2l390.mw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



1 SEQUENCE LISTING 

2 ' V 7 A ^ 

3 (1) General Information: ^ 

4 " ; 

5 (i) APPLICANT: Mosselman, Sieste 

6 Dijkema, Rein 
7 

8 (ii) TITLE OF INVENTION: Novel estrogen receptor 
9 

10 (iii) NUMBER OF SEQUENCES: 28 

11 

12 (iv) CORRESPONDENCE ADDRESS: 

13 (A) ADDRESSEE: Akzo Nobel Patent Dept. 

14 (B) STREET: 1300 Piccard Drive, Suite 206 

15 (C) CITY: Rockville 

16 (D) STATE: Maryland 

17 (E) COUNTRY: US 

18 ( F ) ZIP: 20850 
19 

20 (v) COMPUTER READABLE FORM: 

21 (A) MEDIUM TYPE: Floppy disk 

22 (B) COMPUTER: IBM PC compatible 

23 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

24 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 
25 

2 6 (vi) CURRENT APPLICATION DATA: 

27 (A) APPLICATION NUMBER: US 08/826,361 

28 (B) FILING DATE: 26-MAR-1997 

29 (C) CLASSIFICATION: 
30 

31 (viii) ATTORNEY/AGENT INFORMATION: 

32 (A) NAME: Gormley, Mary E. 

3 3 (B) REGISTRATION NUMBER: 34,40 9 
34 

3 5 (ix) TELECOMMUNICATION INFORMATION: 

36 (A) TELEPHONE: 301-948-7400 

37 (B) TELEFAX: 301-948-9751 
38 
39 

40 ; 

41 (2) INFORMATION FOR SEQ ID NO: 1: 

42 + 

4 3 (i) SEQUENCE CHARACTERISTICS: 

44 ■ (A) LENGTH: 1434 base pairs 

45 ? c (B) TYPE: nucleic acid 

46 * (C) STRANDEDNESS: double 



■fompfriiiiii 
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PAGE: 2 RAW SEQUENCE LISTING DATE: 1 1/04/97 

PATENT APPLIC ATION US/08/826,361A TIME: 14:59:23 

INPUT SET: S2 1390. raw 

4 7 (D) TOPOLOGY: linear 

48 

4 9 (ii) MOLECULE TYPE: cDNA 
50 

51 
52 

5 3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
54 

5 5 ATGAATTACA GCATTCCCAG CAATGTCACT AACTTGGAAG GTGGGCCTGG TCGGCAGACC 6 0 

56 

5 7 ACAAGCCCAA ATGTGTTGTG GCCAACACCT GGGCACCTTT CTCCTTTAGT GGTCCATCGC 12 0 

58 

5 9 CAGTTATCAC ATCTGTATGC GGAACCTCAA AAGAGTCCCT GGTGTGAAGC AAGATCGCTA 180 
60 

61 GAACACACCT TACCTGTAAA CAGAGAGACA CTGAAAAGGA AGGTTAGTGG GAACCGTTGC 24 0 

62 

6 3 GCCAGCCCTG TTACTGGTCC AGGTTCAAAG AGGGATGCTC ACTTCTGCGC TGTCTGCAGC 300 
64 

6 5 GATTACGCAT CGGGATATCA CTATGGAGTC TGGTCGTGTG AAGGATGTAA GGCCTTTTTT 36 0 

66 

6 7 AAAAGAAGCA TTCAAGGACA TAATGATTAT ATTTGTCCAG CTACAAATCA GTGTACAATC 42 0 

68 

6 9 GATAAAAACC GGCGCAAGAG CTGCC AGGCC TGCCGACTTC GGAAGTGTTA CGAAGTGGGA 4 80 
70 

71 ATGGTGAAGT GTGGCTCCCG GAGAGAGAGA TGTGGGTACC GCCTTGTGCG GAGACAGAGA 54 0 

72 

7 3 AGTGCCGACG AGCAGCTGCA CTGTGCCGGC AAGGCCAAGA GAAGTGGCGG CCACGCGCCC 60 0 
74 

7 5 CGAGTGCGGG AGCTGCTGCT GGACGCCCTG AGCCCCGAGC AGCTAGTGCT CACCCTCCTG 66 0 

76 

7 7 GAGGCTGAGC CGCCCCATGT GCTGATCAGC CGCCCCAGTG CGCCCTTCAC CGAGGCCTCC 7 20 

78 

7 9 ATGATGATGT CCCTGACCAA GTTGGCCGAC AAGGAGTTGG TACACATGAT CAGCTGGGCC 7 80 
80 

81 AAGAAGATTC CCGGCTTTGT GGAGCTCAGC CTGTTCGACC AAGTGCGGCT CTTGGAGAGC 84 0 

82 

8 3 TGTTGGATGG AGGTGTTAAT GATGGGGCTG ATGTGGCGCT CAATTGACCA CCCCGGCAAG 90 0 
84 

85 CTCATCTTTG CTCCAGATCT TGTTCTGGAC AGGGATGAGG GGAAATGCGT AGAAGGAATT 96 0 

86 

8 7 CTGGAAATCT TTGACATGCT CCTGGCAACT ACTTCAAGGT TTCGAGAGTT AAAACTCCAA 10 20 

88 

8 9 CACAAAGAAT ATCTCTGTGT CAAGGCCATG ATCCTGCTCA ATTCCAGTAT GTACCCTCTG 10 80 
90 

91 GTCACAGCGA CCCAGGATGC TGACAGCAGC CGGAAGCTGG CTCACTTGCT GAACGCCGTG 114 0 

92 

9 3 ACCGATGCTT TGGTTTGGGT GATTGCCAAG AGCGGCATCT CCTCCCAGCA GCAATCCATG 12 0 0 
94 

95 CGCCTGGCTA ACCTCCTGAT GCTCCTGT0C CACGTCAGGC ATGCGAGTAA CAAGGGCATG 126 0 

96 

97 GAACATCTGC TCAACATGAA GTGCAAAA^T GTgGTCCCAG TGTATGACCT GCTGCTGGAG 13 20 

98 y 

9 9 ATGCTGAATG CCCACGTGCT TCGCGGGTJGC AAGTCCTCCA TCACGGGGTC CGAGTGCAGC 13 80 



PAGE: 3 RAW SEQUENCE LISTING DATE: 1 1/04/97 

PATENT APPLICATION US/08/826,361A TIME: 14:59:25 

INPUT SET: S2 1390. raw 

100 

101 CCGGCAGAGG ACAGTAAAAG CAAAGAGGGC TCCCAGAACC CACAGTCTCA GTGA 14 34 

102 

103 (2) INFORMATION FOR SEQ ID NO: 2: 
104 

105 (i) SEQUENCE CHARACTERISTICS: 

106 (A) LENGTH: 1251 base pairs 

107 (B) TYPE: nucleic acid 

108 (C) STRANDEDNESS : double 

109 (D) TOPOLOGY: linear 
110 

111 (ii) MOLECULE TYPE: cDNA 

112 

113 

114 

115 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

116 

117 ATGAATTACA GCATTCCCAG CAATGTCACT AACTTGGAAG GTGGGCCTGG TCGGCAGACC 60 
118 

119 ACAAGCCCAA ATGTGTTGTG GCCAACACCT GGGCACCTTT CTCCTTTAGT GGTCCATCGC 120 
120 

121 CAGTTATCAC ATCTGTATGC GGAACCTCAA AAGAGTCCCT GGTGTGAAGC AAGATCGCTA 180 
122 

123 GAACACACCT TACCTGTAAA CAGAGAGACA CTGAAAAGGA AGGTTAGTGG GAACCGTTGC 24 0 

124 

125 GCCAGCCCTG TTACTGGTCC AGGTTCAAAG AGGGATGCTC ACTTCTGCGC TGTCTGCAGC 300 
126 

127 GATTACGCAT CGGGATATCA CTATGGAGTC TGGTCGTGTG AAGGATGTAA GGCCTTTTTT 36 0 

128 

129 AAAAGAAGCA TTCAAGGACA TAATGATTAT ATTTGTCCAG CTACAAATCA GTGTACAATC 4 20 

130 

131 GATAAAAACC GGCGCAAGAG CTGCCAGGCC TGCCGACTTC GGAAGTGTTA CGAAGTGGGA 4 80 

132 

133 ATGGTGAAGT GTGGCTCCCG GAGAGAGAGA TGTGGGTACC GCCTTGTGCG GAGACAGAGA 540 
134 

135 AGTGCCGACG AGCAGCTGCA CTGTGCCGGC AAGGCCAAGA GAAGTGGCGG CCACGCGCCC 600 
136 

137 CGAGTGCGGG AGCTGCTGCT GGACGCCCTG AGCCCCGAGC AGCTAGTGCT CACCCTCCTG 660 
138 

139 GAGGCTGAGC CGCCCCATGT GCTGATCAGC CGCCCCAGTG CGCCCTTCAC CGAGGCCTCC 720 
140 

141 ATGATGATGT CCCTGACCAA GTTGGCCGAC AAGGAGTTGG TACACATGAT CAGCTGGGCC 7 80 

142 

14 3 AAGAAGATTC CCGGCTTTGT GGAGCTCAGC CTGTTCGACC AAGTGCGGCT CTTGGAGAGC 84 0 

144 

14 5 TGTTGGATGG AGGTGTTAAT GATGGGGCTG ATGTGGCGCT CAATTGACCA CCCCGGCAAG 900 

146 > » 

147 CTCATCTTTG CTCCAGATCT TGTTCTGGAC AGGGATGAGG GGAAATGCGT AGAAGGAATT 96 0 
148 

14 9 CTGGAAATCT TTGACATGCT CCTGGCAACT ACTTCAAGGT TTCGAGAGTT AAAACTCCAA 1020 



x 



150 

151 CACAAAGAAT ATCTCTGTGT CAAGGCCATG ATCCTGCTCA ATTCCAGTAT GTACCCTCTG 1080 

152 T . ; 
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RAW SEQUENCE LISTING DATE: 1 1/04/97 

PATENT APPLICATION US/08/826,361A TIME: 14:59:28 

INPUT SET: S2 1390. raw 

GTCACAGCGA CCCAGGATGC TGACAGCAGC CGGAAGCTGG CTCACTTGCT GAACGCCGTG 1140 
ACCGATGCTT TGGTTTGGGT GATTGCCAAG AGCGGCATCT CCTCCCAGCA GCAATCCATG 1200 
CGCCTGGCTA ACCTCCTGAT GCTCCTGTCC CACGTCAGGC ATGCGAGGTG A 1251 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Cys Ala Val Cys Ser Asp Tyr Ala Ser Gly Tyr His Tyr Gly Val Trp 
15 10 15 

Ser Cys Glu Gly Cys Lys Ala Phe Phe Lys Arg Ser lie Gin Gly His 
20 25 30 

Asn Asp Tyr lie Cys Pro Ala Thr Asn Gin Cys Thr lie Asp Lys Asn 
35 40 45 

Arg Arg Lys Ser Cys Gin Ala Cys Arg Leu Arg Lys Cys Tyr Glu Val 
50 55 60 

Gly Met 
65 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

■t « 

Leu Val Leu Thr Leu.'L eu Glii Ala Glu Pro Pro His Val Leu lie Ser 

1 5 \ 10 15 




PACK: 5 RAW SEQUENCE LISTING DATE: 1 1/04/97 

PATENT APPLICATION US/08/826,361A TIME: 14:59:30 

INPUT SET: S2 1390. raw 

206 

207 Arg Pro Ser Ala Pro Phe Thr Glu Ala Ser Met Met Met Ser Leu Thr 

208 20 25 30 
209 

210 Lys Leu Ala Asp Lys Glu Leu Val His Met lie Ser Trp Ala Lys Lys 

211 35 40 45 
212 

213 lie Pro Gly Phe Val Glu Leu Ser Leu Phe Asp Gin Val Arg Leu Leu 

214 50 55 60 
215 

216 Glu Ser Cys Trp Met Glu Val Leu Met Met Gly Leu Met Trp Arg Ser 

217 65 70 75 80 
218 

219 lie Asp His Pro Gly Lys Leu lie Phe Ala Pro Asp Leu Val Leu Asp 

220 85 90 95 
221 

222 Arg Asp Glu Gly Lys Cys Val Glu Gly lie Leu Glu lie Phe Asp Met 

223 100 105 110 
224 

225 Leu Leu Ala Thr Thr Ser Arg Phe Arg Glu Leu Lys Leu Gin His Lys 

226 115 120 125 
227 

228 Glu Tyr Leu Cys Val Lys Ala Met lie Leu Leu Asn Ser Ser Met Tyr 

229 130 135 140 
230 

2 31 Pro Leu Val Thr Ala Thr Gin Asp Ala Asp Ser Ser Arg Lys Leu Ala 

232 145 150 155 160 

233 

234 His Leu Leu Asn Ala Val Thr Asp Ala Leu Val Trp Val lie Ala Lys 

235 165 170 175 
236 

237 Ser Gly lie Ser Ser Gin Gin Gin Ser Met Arg Leu Ala Asn Leu Leu 

238 180 185 190 
239 

240 Met Leu Leu Ser His Val Arg His Ala Ser Asn Lys Gly Met Glu His 

241 195 200 205 
242 

243 Leu Leu Asn Met Lys Cys Lys Asn Val Val Pro Val Tyr Asp Leu Leu 

244 210 215 220 
245 

246 Leu Glu Met Leu Asn Ala His Val Leu 

247 225 230 
248 

24 9 (2) INFORMATION FOR SEQ ID NO: 5: 
250 

251 (i) SEQUENCE CHARACTERISTICS: 

25 : 2 «> (A) LENGTH: 477 amino acids 

253 (B) TYPE: amino acid 

2m (C) STRANDEDNESS: single 

255 (D) TOPOLOGY: unknown 

2|6 

2^7 (ii) MOLECULE TYPE: protein 

258 



% * 



SEQUENCE VERIFICATION REPORT 

PATENT APPLICATION US/08/826,361A 



DATE: 11/04/97 
TIME: 14:59:33 



INPUT SET: S21390.raw 



Original Text 
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